Picture for Haitham Bou Ammar

Haitham Bou Ammar

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks

Add code
Feb 05, 2026
Viaarxiv icon

Scalable Power Sampling: Unlocking Efficient, Training-Free Reasoning for LLMs via Distribution Sharpening

Add code
Jan 29, 2026
Viaarxiv icon

Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving

Add code
Jul 03, 2025
Viaarxiv icon

Almost Surely Safe Alignment of Large Language Models at Inference-Time

Add code
Feb 03, 2025
Viaarxiv icon

Efficient Reinforcement Learning with Large Language Model Priors

Add code
Oct 10, 2024
Figure 1 for Efficient Reinforcement Learning with Large Language Model Priors
Figure 2 for Efficient Reinforcement Learning with Large Language Model Priors
Figure 3 for Efficient Reinforcement Learning with Large Language Model Priors
Figure 4 for Efficient Reinforcement Learning with Large Language Model Priors
Viaarxiv icon

Mixture of Attentions For Speculative Decoding

Add code
Oct 04, 2024
Viaarxiv icon

Group Robust Preference Optimization in Reward-free RLHF

Add code
May 30, 2024
Figure 1 for Group Robust Preference Optimization in Reward-free RLHF
Figure 2 for Group Robust Preference Optimization in Reward-free RLHF
Figure 3 for Group Robust Preference Optimization in Reward-free RLHF
Figure 4 for Group Robust Preference Optimization in Reward-free RLHF
Viaarxiv icon

Framework and Benchmarks for Combinatorial and Mixed-variable Bayesian Optimization

Add code
Jun 16, 2023
Viaarxiv icon

End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes

Add code
May 25, 2023
Figure 1 for End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes
Figure 2 for End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes
Figure 3 for End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes
Figure 4 for End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes
Viaarxiv icon

Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions

Add code
May 16, 2023
Figure 1 for Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
Figure 2 for Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
Figure 3 for Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
Figure 4 for Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
Viaarxiv icon